Basic Statistics

Raw Counts

Name Value
Rows 122,635
Columns 31
Discrete columns 23
Continuous columns 8
All missing columns 0
Missing observations 58,793
Complete Rows 67,139
Total observations 3,801,685
Memory allocation 33.3 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (by frequency)

## 8 columns ignored with more than 50 categories.
## accident_index: 122635 categories
## police_force: 51 categories
## date: 365 categories
## time: 1439 categories
## local_authority_district: 380 categories
## local_authority_highway: 207 categories
## lsoa_of_accident_location: 27966 categories
## datetime: 86933 categories

QQ Plot

Correlation Analysis

## 8 features with more than 20 categories ignored!
## accident_index: 67139 categories
## police_force: 43 categories
## date: 365 categories
## time: 1433 categories
## local_authority_district: 348 categories
## local_authority_highway: 175 categories
## lsoa_of_accident_location: 22911 categories
## datetime: 54197 categories

Principal Component Analysis

## 7 features with more than 50 categories ignored!
## accident_index: 67139 categories
## date: 365 categories
## time: 1433 categories
## local_authority_district: 348 categories
## local_authority_highway: 175 categories
## lsoa_of_accident_location: 22911 categories
## datetime: 54197 categories